Picture for Cheng Yang

Cheng Yang

Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware Residual Guidance

Add code
Feb 01, 2026
Viaarxiv icon

Dual Latent Memory for Visual Multi-agent System

Add code
Jan 31, 2026
Viaarxiv icon

LocationAgent: A Hierarchical Agent for Image Geolocation via Decoupling Strategy and Evidence from Parametric Knowledge

Add code
Jan 27, 2026
Viaarxiv icon

Spatial-Agent: Agentic Geo-spatial Reasoning with Scientific Core Concepts

Add code
Jan 23, 2026
Viaarxiv icon

Graph-Anchored Knowledge Indexing for Retrieval-Augmented Generation

Add code
Jan 23, 2026
Viaarxiv icon

Data-centric Prompt Tuning for Dynamic Graphs

Add code
Jan 17, 2026
Viaarxiv icon

ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Add code
Jan 13, 2026
Viaarxiv icon

Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning

Add code
Jan 12, 2026
Viaarxiv icon

InstructMoLE: Instruction-Guided Mixture of Low-rank Experts for Multi-Conditional Image Generation

Add code
Dec 25, 2025
Viaarxiv icon

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Add code
Nov 19, 2025
Figure 1 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 2 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 3 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Figure 4 for Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
Viaarxiv icon